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The MAILING DATE of this communication appears on the cover sheet with the correspondence address • 
Period for Reply 

A SHORTENED STATUTORY PERIOD FOR REPLY IS SET TO EXPIRE 3 MONTH(S) FROM 
THE MAILING DATE OF THIS COMMUNICATION. 

- Extensions of time may be available under the provisions of 37 CFR 1.136(a). In no event, however, may a reply be timely filed 
after SIX (6) MONTHS from the mailing date of this communication. 

- If the period for reply specified above is less than thirty (30) days, a reply within the statutory minimum of thirty (30) days will be considered timely. 

- If NO period for reply is specified above, the maximum statutory period will apply and will expire SIX (6) MONTHS from the mailing date of this communication. 

- Failure to reply within the set or extended period for reply will, by statute, cause the application to become ABANDONED (35 U.S.C. § 1 33). 
Any reply received by the Office later than three months after the mailing date of this communication, even if timely filed, may reduce any 
earned patent term adjustment. See 37 CFR 1.704(b). 

Status 

1)n Responsive to communication(s) filed on , 

2a)n This action is FINAL. 2b)K This action is non-final. 

3) n Since this application is in condition for allowance except for formal matters, prosecution as to the merits is 

closed in accordance with the practice under Ex parte Quayle, 1935 CD. 11, 453 O.G. 213. 

Disposition of Claims 

4) ^ Claim(s) 1-31 is/are pending in the application. 

4a) Of the above claim(s) is/are withdrawn from consideration. 

5) 0 Claim(s) is/are allowed. 

6) S Claim(s) 1-31 is/are rejected. 

7) 0 Claim(s) is/are objected to. 

8) n Claim(s) are subject to restriction and/or election requirement. 

Application Papers 

9) S The specification is objected to by the Examiner. 

10)^ The drawing(s) filed on 21 June 2001 is/are: a)^ accepted or b)n objected to by the Examiner. 

Applicant may not request that any objection to the drawing(s) be held in abeyance. See 37 CFR 1.85(a). 

Replacement drawing sheet(s) including the correction is required if the drawing(s) is objected to. See 37 CFR 1.121(d). 
11 )□ The oath or declaration is objected to by the Examiner. Note the attached Office Action or form PTO-152. 

Priority under 35 U.S.C. § 119 

1 2)n Acknowledgment is made of a claim for foreign priority under 35 U.S.C, § 1 1 9(a)-(d) or (f). 
a)n All b)n Some * c)^ None of: 

1 .□ Certified copies of the priority documents have been received. 

2.n Certified copies of the priority documents have been received in Application No. . 




3.n Copies of the certified copies of the priority documents have been received in this National Stage 
application from the International Bureau (PCT Rule 17.2(a)). 
* See the attached detailed Office action for a list of the certified copies not received. 
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DETAILED ACTION 



Specification 

1 . The specification is objected to because the arrangement of the disclosed 

application does not conform with 37 CFR 1 .77(b). 

Section heading appear underlined and in lowercase format throughout the 
disclosed specification. Section headings should not be underlined , and should 
appear in UPPERCASE format Appropriate corrections are required according to 
the guidelines provided below: 



2. The following guidelines illustrate the preferred layout for the specification of a 

utility application. These guidelines are suggested for the applicant's use. 

Arrangement of the Specification 
As provided in 37 CFR 1.77(b), the specification of a utility application should 
include the following sections in order. Each of the lettered items should appear in 
upper case, without underlining or bold type, as a section heading. If no text follows the 
section heading, the phrase "Not Applicable" should follow the section heading: 

(a) TITLE OF THE INVENTION. 

(b) CROSS-REFERENCE TO RELATED APPLICATIONS. 

(c) STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR 

DEVELOPMENT. 

(d) INCORPORATION-BY-REFERENCE OF MATERIAL SUBMITTED ON A 

COMPACT DISC (See 37 CFR 1.52(e)(5) and MPEP 608.05. Computer 
program listings (37 CFR 1.96(c)), "Sequence Listings" (37 CFR 1.821(c)), 
and tables having more than 50 pages of text are permitted to be 
submitted on compact discs.) or 

REFERENCE TO A "MICROFICHE APPENDIX" (See MPEP § 608.05(a). 
"Microfiche Appendices" were accepted by the Office until March 1 , 2001 .) 

(e) BACKGROUND OF THE INVENTION. 

(1) Field of the Invention. 



Application/Control Number: 09/886,771 



Pages 



Art Unit: 2164 



(2) Description of Related Art including information disclosed under 37 
CFR 1.97 and 1.98. 

(f) BRIEF SUMMARY OF THE INVENTION. 

(g) BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWING(S). 

(h) DETAILED DESCRIPTION OF THE INVENTION. 

(i) CLAIM OR CLAIMS (commencing on a separate sheet). 

(j) ABSTRACT OF THE DISCLOSURE (commencing on a separate sheet). 

(k) SEQUENCE LISTING (See MPEP § 2424 and 37 CFR 1.821-1.825. A 
"Sequence Listing" is required on paper if the application discloses a 
nucleotide or amino acid sequence as defined in 37 CFR 1.821(a) and if 
the required "Sequence Listing" is not submitted as an electronic 
document on compact disc). 



3. The abstract of the disclosure is objected to because it contains more than 150 
words. 

Correction is required. See MPEP § 608.01(b). 

Applicant is reminded of the proper language and format for an abstract of the 
disclosure. 

The abstract should be in narrative form and generally limited to a single 
paragraph on a separate sheet within the range of 50 to 1 50 words. It is important that 
the abstract not exceed 150 words in length since the space provided for the abstract 
on the computer tape used by the printer is limited. The form and legal phraseology 
often used in patent claims, such as "means" and "said," should be avoided. The 
abstract should describe the disclosure sufficiently to assist readers in deciding whether 
there is a need for consulting the full patent text for details. 

The language should be clear and concise and should not repeat information 
given in the title. It should avoid using phrases which can be implied, such as, "The 
disclosure concerns," "The disclosure defined by this invention," "The disclosure 
describes," etc. 



Claim Rejections - 35 USC § 102 
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4. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made In this Office action: 

A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign country or in public 
use or on sale in this country, more than one year prior to the date of application for patent in the United 
States. 

5. Claims 1-31 are rejected under 35 U.S.C. 102(b) as being anticipated by 
Favvad et al. (PCT Pub No. WO 99/62007). 

As to claim 1 , Favvad et al. teaches a method for clustering data in a database 
comprising: 

a) providing a database having a number of data records having both discrete 
and continuous attributes (see page 7, lines 4-6); 

b) grouping together data records from the database which have specified 
discrete attribute configurations (see page 8, lines 5 through page 9, lines 1-13; and 
see Table 1 and "Cluster AttributeA/alue Probability Tables"); 

c) clustering data records having the same or similar specified discrete attribute 
configuration based on the continuous attributes to produce an intermediate set of 
data clusters (see page 11, line 42 through page 12, line 32); and 

d) merging together clusters from the intermediate set of data clusters to produce 
a clustering model (see page 14, lines 26-28; and see figures 8A-8D). 



As to claims 2, 9, and 23, Favvad et al. teaches wherein the clustering model 
includes a table of probabilities for the discrete data attributes of the data records for 
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a cluster and wherein the cluster model for continuous data attributes comprises a 
mean and a covariance for each cluster lines (see claim lb). 

As to claims 3, 14, and 24, Fayyad et al. teaches wherein the process of merging 
of intermediate clusters is ended when a specified number of clusters has been 
formed (see page 8, lines 12-14, where "specified number of clusters" is read on 
"initial cluster number K=3"; and see claim 14, where "specified number of clusters" 
is read on "K clusters"). 

As to claims 4 and 25, Fayyad et al. teaches wherein the step of merging of 
intermediate clusters is ended when a distance between intermediate clusters is 
greater than a specified minimum distance (see page 27, line 12 through page 28, 
line 26, where "distance between intermediate clusters" is read on "stopping criteria" 
and "specified minimum distance" is read on "the sum of these two numbers" and 
"the sum of these numbers"). 

As to claims 5 and 26, Fayyad et al. teaches wherein the discrete attributes are 
Boolean and similarity between configurations is based on a distance between bit 
patterns of the discrete attributes (see page 33 where "Boolean" and "bit patterns" is 
read on "0/1 assignments"). 
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As to claims 6 and 20, Fa wad et al. teaches wherein one or more of the discrete 
attributes have more than two possible values and comprising the step of 
subdividing a discrete attribute having more than two possible values into multiple 
Boolean value attributes (see page 33 where "Boolean" and "two possible values" is 
read on "0/1 assignments"). 

As to claims 7 and 27, Favvad et aL teaches wherein the step of identifying 
configurations includes tabulating data records having the same discrete attribute bit 
pattern and combining the data records from similar configurations before clustering 
the data records so tabulated based on the continuous attributes (see page 33 
where "bit pattern" is read on "0/1 assignments). 

As to claim 8, Favvad et al. teaches a method for clustering data in a database 
comprising: 

a) providing a database having a number of data records having both discrete 
and continuous attributes (see page 14, line 32 through page 15, line 2); 

b) counting data records from the database which have the same discrete 
attribute configuration and identifying a first set of configurations wherein the number 
of data records of each configuration of the first set of configurations exceeds a 
threshold number of data records (see page 15, line 21 through page 16, line 15, 
where "counting data records" is read on "counting the number of data records" and 
"exceeds a threshold number of data records" is read on "stopping criteria"); 
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c) adding data records from the database not belonging to one of the first set of 
configurations with a configuration within the first set of configurations to produce a 
subset of records from the database belonging to configurations in the first set of 
configurations (see page 15, lines 12-18, where "subset of records" is read on 
"compressed data"); and 

d) clustering the subset of records contained within at least some of the first set 
of configurations based on the continuous data attributes of records contained within 
that first set of configurations to produce a clustering model (see page 15, lines 19- 
27, where "continuous data attributes" is read on "ordered attributes"). 

As to claim 10, Fa wad et al. teaches wherein an added record not contained 
within the first set of configurations is added to one of the first set of configurations 
based on a distance between a smaller configuration to which the added record 
belongs during counting of records in different configurations (see page 15, line 24- 
25, where "counting" is read on "'M' counting"). 

As to claims 1 1 and 28, Fa wad et al. teaches wherein the clustering of records 
from a configuration based on continuous data attributes results in a variable 
number of clusters for each configuration based on the number of records in the 
configuration (see page 15, lines 19-32, where "continuous data attributes" is read 
on "ordered attributes"; and where "variable number of clusters" is read on "scalable 
clustering process"). 
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As to claim 12, Favyad et al. teaches wherein the clustering of records from 
records falling within a configuration of the first set results in a number of 
intermediate clusters which are merged together to form the cluster model (see page 
18, lines 23-31, where "records falling with a configuration" is read on "data points 
falling within a given cluster"). 

As to claim 13, Favvad et al. teaches wherein intermediate clusters are merged 
together based on a distance between clusters that is determined based on both 
continuous and discrete attributes of the intermediate clusters (see page 4, line 20 
through page 5, line 4, where "clusters are merged" is read on "membership of a 
given record in a particular cluster"; and see page 19, lines 1-7, where "distance 
between clusters" is read on "sufficiently 'close' to an existing CS subcluster"). 

As to claim 15, Favvad et al. teaches wherein the merging of intermediate 
clusters is performed until a distance between two closest clusters is greater than a 
threshold distance (see page 19, line 25 through page 20, line 2). 

As to claims 16 and 29, Favvad et al. teaches wherein a list of records of each 
configuration in the first set of configurations is maintained as data records are 
accessed from the database (see page 8, lines 5 through page 9, lines 1-13; and 
see Table 1 and "Cluster AttributeA/alue Probability Tables"). 
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As to claims 17 and 30, Favvad et al. teaches where the clustering based on the 
continuous attributes of records within a configuration is performed using 
expectation maximization clustering of the continuous attributes (see page 4, line 20 
through page 5, line 4). 

As to claim 18, Favvad et al. teaches a data processing system comprising: 

a) a storage medium for storing a database having a number of data records 
having both discrete and continuous attributes (see page 7, lines 4-9); 

b) a computer for evaluating data records from the database and building a 
clustering model that describes data in the database (see page 7, lines 1-5); and 

c) a database management system including a component for selectively 
retrieving data records from the database for evaluation by the computer (see page 
7, lines 9-1 1 , where "retrieving data records" is read on "brings data from the 
database"); 

For the remaining steps of this claim, the applicant is directed to remarks and 
discussions made in claim 1 above. 

As to claim 19, Favvad et al. teaches wherein the computer includes a rapid 
access storage for maintaining a list of data records from the database for data 
records having a specified discrete attribute configuration to facilitate clustering of 
the data records based on their continuous attributes (see page 5, lines 5-8). 
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As to claim 21 , Fa wad et al. teaches wherein the rapid access storage of the 
computer includes a data structure for storing a clustering model (see figures 8A- 
8D). 

As to claim 22, Favvad et al. teaches a computer readable medium containing 
stored instructions for clustering data in a database comprising instructions for (see 
page 7, lines 1-11): 

a) reading records from a database having a number of data records having both 
discrete and continuous attributes (see page 7, lines 4-1 1 , where "reading records" 
is read on "brings data from the database"); 

For the remaining steps of this claim, the applicant is directed to remarks and 
discussions made in claim 1 above. 

As to claim 31, Favvad et al. teaches where records are assigned to a single 
cluster during the expectation maximization clustering process (see page 4, lines 26- 
31; and see claim 24). 

Conclusion 

6. Any inquiries concerning this communication or earlier communications from the 
examiner should be directed to Patricia C. Zicht, whose telephone number is (571) 272- 
5866. The examiner can normally be reached on Mondays-Fridays from 07:30 am to 
04:00 pm. 
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If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Dov Popovici, can be reached at (571) 272-4083. 

pcz 

December 7, 2004 




SAM RiMELL 
PRIMARY EXAMINER 



